Markov decision process - PDFSEARCH.IO - Document Search Engine

Markov decision process
Results: 537

#	Item
291	Exploiting Fully Observable and Deterministic Structures in Goal POMDPs Add to Reading List Source URL: www.ida.liu.se Language: English - Date: 2013-08-29 10:13:42 Stochastic control Partially observable Markov decision process Markov decision process Automated planning and scheduling Bayesian network FO S0 Finite-state machine Macro Statistics Dynamic programming Markov processes
292	Selecting the State-Representation in Reinforcement Learning Odalric-Ambrym Maillard INRIA Lille - Nord Europe [removed] Add to Reading List Source URL: eprints.pascal-network.org Language: English - Date: 2011-11-02 05:20:38 Markov processes Dynamic programming Markov decision process Stochastic control Distribution Multi-armed bandit Statistics Mathematical analysis Generalized functions
293	Feature Selection for Domain Knowledge Representation through Multitask Learning Benjamin Rosman Mobile Intelligent Autonomous Systems CSIR, South Africa [removed] Add to Reading List Source URL: www.benjaminrosman.com Language: English - Date: 2014-09-24 09:42:41 Artificial intelligence Reinforcement learning Q-learning Feature selection Prior probability Action selection Markov decision process One-shot learning Golden ratio base Statistics Machine learning Probability and statistics
294	Partially Observable Markov Decision Processes for Spoken Dialog Systems Jason D. Williams1 Steve Young Add to Reading List Source URL: mi.eng.cam.ac.uk Language: English - Date: 2013-10-14 15:42:14 Stochastic control Humanâ€“computer interaction Control theory Partially observable Markov decision process Dialog system Automated planning and scheduling Markov decision process Dialog Speech recognition Statistics Dynamic programming Markov processes
295	Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems F. Jurˇc´ıcˇ ek, B. Thomson, S. Keizer, F. Mairesse, M. Gaˇsi´c, K. Yu, and S. Young Engineering Depa Add to Reading List Source URL: mi.eng.cam.ac.uk Language: English - Date: 2010-11-01 08:12:32 Dynamic programming Stochastic control Markov models Expectation–maximization algorithm Partially observable Markov decision process Maximum likelihood Reinforcement learning Markov chain Normal distribution Statistics Markov processes Estimation theory
296	PARAMETER LEARNING FOR POMDP SPOKEN DIALOGUE MODELS B. Thomson, F. Jurˇc´ıcˇ ek, M. Gaˇsi´c, S. Keizer, F. Mairesse, K. Yu, S. Young Cambridge University Engineering Department ABSTRACT The partially observable Mar Add to Reading List Source URL: mi.eng.cam.ac.uk Language: English - Date: 2010-11-09 15:42:38 Estimation theory Expectation–maximization algorithm Maximum likelihood Partially observable Markov decision process Parameter Kullback–Leibler divergence Normal distribution Bayesian network Dialogue Statistics Statistical theory Bayesian statistics
297	Available online at www.sciencedirect.com Computer Speech and Language[removed]–174 COMPUTER SPEECH AND Add to Reading List Source URL: mi.eng.cam.ac.uk Language: English - Date: 2010-05-02 07:09:58 Stochastic control Partially observable Markov decision process Graphical models Probability theory Dialogue Probability Bayesian network Markov decision process Markov chain Statistics Markov processes Dynamic programming
298	POMDP-based dialogue manager adaptation to extended domains M. Gaˇsi´c, C. Breslin, M. Henderson, D. Kim, M. Szummer, B. Thomson, P. Tsiakoulis and S. Young Cambridge University Engineering Department {mg436,cb404,mh52 Add to Reading List Source URL: mi.eng.cam.ac.uk Language: English - Date: 2013-08-27 04:45:47 Partially observable Markov decision process Stochastic control Literature Domain Dialogue Speech recognition Protein domain Kernel Fiction Statistics Dynamic programming
299	Seminar Series 3108 Etcheverry Hall Berkeley Campus December 1, 2014 3:40pm - 5:00pm Archis Ghate Add to Reading List Source URL: www.ieor.berkeley.edu Language: English - Date: 2014-11-25 16:36:59 Statistics Mathematical optimization Linear programming Simplex algorithm Markov decision process Robust optimization Algorithm Operations research Mathematics Applied mathematics
300	Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu Add to Reading List Source URL: arxiv.org Language: English - Date: 2013-12-19 20:23:45 Computational neuroscience Cybernetics Reinforcement learning Q-learning Temporal difference learning SARSA Markov decision process Unsupervised learning Recurrent neural network Machine learning Neural networks Statistics

UPDATE